Dataset statistics
| Number of variables | 26 |
|---|---|
| Number of observations | 10000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.9 MiB |
| Average record size in memory | 201.0 B |
Variable types
| NUM | 22 |
|---|---|
| CAT | 2 |
| BOOL | 2 |
BILL_AMT2 is highly correlated with BILL_AMT1 and 1 other fields | High correlation |
BILL_AMT1 is highly correlated with BILL_AMT2 | High correlation |
BILL_AMT3 is highly correlated with BILL_AMT2 and 1 other fields | High correlation |
BILL_AMT4 is highly correlated with BILL_AMT3 and 1 other fields | High correlation |
BILL_AMT5 is highly correlated with BILL_AMT4 and 1 other fields | High correlation |
BILL_AMT6 is highly correlated with BILL_AMT5 | High correlation |
PAY_AMT2 is highly skewed (γ1 = 24.47786596) | Skewed |
ID has unique values | Unique |
PAY_0 has 4918 (49.2%) zeros | Zeros |
PAY_2 has 5269 (52.7%) zeros | Zeros |
PAY_3 has 5340 (53.4%) zeros | Zeros |
PAY_4 has 5530 (55.3%) zeros | Zeros |
PAY_5 has 5660 (56.6%) zeros | Zeros |
PAY_6 has 5443 (54.4%) zeros | Zeros |
BILL_AMT1 has 668 (6.7%) zeros | Zeros |
BILL_AMT2 has 929 (9.3%) zeros | Zeros |
BILL_AMT3 has 1022 (10.2%) zeros | Zeros |
BILL_AMT4 has 1162 (11.6%) zeros | Zeros |
BILL_AMT5 has 1267 (12.7%) zeros | Zeros |
BILL_AMT6 has 1430 (14.3%) zeros | Zeros |
PAY_AMT1 has 1837 (18.4%) zeros | Zeros |
PAY_AMT2 has 1823 (18.2%) zeros | Zeros |
PAY_AMT3 has 2018 (20.2%) zeros | Zeros |
PAY_AMT4 has 2210 (22.1%) zeros | Zeros |
PAY_AMT5 has 2324 (23.2%) zeros | Zeros |
PAY_AMT6 has 2423 (24.2%) zeros | Zeros |
Reproduction
| Analysis started | 2020-12-28 14:41:31.688612 |
|---|---|
| Analysis finished | 2020-12-28 14:42:41.000704 |
| Duration | 1 minute and 9.31 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 10000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5000.5 |
|---|---|
| Minimum | 1 |
| Maximum | 10000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 500.95 |
| Q1 | 2500.75 |
| median | 5000.5 |
| Q3 | 7500.25 |
| 95-th percentile | 9500.05 |
| Maximum | 10000 |
| Range | 9999 |
| Interquartile range (IQR) | 4999.5 |
Descriptive statistics
| Standard deviation | 2886.89568 |
|---|---|
| Coefficient of variation (CV) | 0.5773214038 |
| Kurtosis | -1.2 |
| Mean | 5000.5 |
| Median Absolute Deviation (MAD) | 2500 |
| Skewness | 0 |
| Sum | 50005000 |
| Variance | 8334166.667 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 5424 | 1 | < 0.1% | |
| 1338 | 1 | < 0.1% | |
| 7481 | 1 | < 0.1% | |
| 5432 | 1 | < 0.1% | |
| 9526 | 1 | < 0.1% | |
| 3379 | 1 | < 0.1% | |
| 1330 | 1 | < 0.1% | |
| 7473 | 1 | < 0.1% | |
| 9518 | 1 | < 0.1% | |
| Other values (9990) | 9990 | 99.9% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 10000 | 1 | < 0.1% | |
| 9999 | 1 | < 0.1% | |
| 9998 | 1 | < 0.1% | |
| 9997 | 1 | < 0.1% | |
| 9996 | 1 | < 0.1% |
LIMIT_BAL
Real number (ℝ≥0)
| Distinct | 74 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 167871 |
|---|---|
| Minimum | 1000 |
| Maximum | 800000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 20000 |
| Q1 | 50000 |
| median | 140000 |
| Q3 | 240000 |
| 95-th percentile | 440000 |
| Maximum | 800000 |
| Range | 799000 |
| Interquartile range (IQR) | 190000 |
Descriptive statistics
| Standard deviation | 131915.0011 |
|---|---|
| Coefficient of variation (CV) | 0.785811731 |
| Kurtosis | 0.4596630385 |
| Mean | 167871 |
| Median Absolute Deviation (MAD) | 90000 |
| Skewness | 0.9737561444 |
| Sum | 1678710000 |
| Variance | 1.740156752e+10 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 50000 | 1071 | 10.7% | |
| 20000 | 680 | 6.8% | |
| 30000 | 532 | 5.3% | |
| 80000 | 505 | 5.1% | |
| 200000 | 485 | 4.9% | |
| 150000 | 352 | 3.5% | |
| 100000 | 348 | 3.5% | |
| 180000 | 321 | 3.2% | |
| 360000 | 305 | 3.0% | |
| 140000 | 261 | 2.6% | |
| Other values (64) | 5140 | 51.4% |
| Value | Count | Frequency (%) | |
| 1000 | 100 | 1.0% | |
| 10000 | 164 | 1.6% | |
| 20000 | 680 | 6.8% | |
| 30000 | 532 | 5.3% | |
| 40000 | 75 | 0.8% |
| Value | Count | Frequency (%) | |
| 800000 | 1 | < 0.1% | |
| 750000 | 2 | < 0.1% | |
| 740000 | 1 | < 0.1% | |
| 730000 | 1 | < 0.1% | |
| 720000 | 1 | < 0.1% |
SEX
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| 2 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 2 | 6023 | 60.2% | |
| 1 | 3977 | 39.8% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
EDUCATION
Real number (ℝ≥0)
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.8961 |
|---|---|
| Minimum | 0 |
| Maximum | 6 |
| Zeros | 8 |
| Zeros (%) | 0.1% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.8972097238 |
|---|---|
| Coefficient of variation (CV) | 0.4731869225 |
| Kurtosis | 4.600476642 |
| Mean | 1.8961 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.566305956 |
| Sum | 18961 |
| Variance | 0.8049852885 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 2 | 4585 | 45.9% | |
| 1 | 3514 | 35.1% | |
| 3 | 1633 | 16.3% | |
| 6 | 124 | 1.2% | |
| 5 | 90 | 0.9% | |
| 4 | 46 | 0.5% | |
| 0 | 8 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 8 | 0.1% | |
| 1 | 3514 | 35.1% | |
| 2 | 4585 | 45.9% | |
| 3 | 1633 | 16.3% | |
| 4 | 46 | 0.5% |
| Value | Count | Frequency (%) | |
| 6 | 124 | 1.2% | |
| 5 | 90 | 0.9% | |
| 4 | 46 | 0.5% | |
| 3 | 1633 | 16.3% | |
| 2 | 4585 | 45.9% |
MARRIAGE
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 224 |
| 0 | 13 |
| Value | Count | Frequency (%) | |
| 2 | 5287 | 52.9% | |
| 1 | 4476 | 44.8% | |
| 3 | 224 | 2.2% | |
| 0 | 13 | 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
AGE
Real number (ℝ≥0)
| Distinct | 55 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.3308 |
|---|---|
| Minimum | 18 |
| Maximum | 75 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 23 |
| Q1 | 28 |
| median | 34 |
| Q3 | 41 |
| 95-th percentile | 53 |
| Maximum | 75 |
| Range | 57 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 9.373022816 |
|---|---|
| Coefficient of variation (CV) | 0.2652932517 |
| Kurtosis | 0.03356523187 |
| Mean | 35.3308 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.698678083 |
| Sum | 353308 |
| Variance | 87.85355672 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 29 | 533 | 5.3% | |
| 27 | 492 | 4.9% | |
| 28 | 483 | 4.8% | |
| 30 | 479 | 4.8% | |
| 31 | 425 | 4.2% | |
| 26 | 412 | 4.1% | |
| 32 | 380 | 3.8% | |
| 33 | 378 | 3.8% | |
| 24 | 374 | 3.7% | |
| 35 | 374 | 3.7% | |
| Other values (45) | 5670 | 56.7% |
| Value | Count | Frequency (%) | |
| 18 | 100 | 1.0% | |
| 21 | 25 | 0.2% | |
| 22 | 178 | 1.8% | |
| 23 | 308 | 3.1% | |
| 24 | 374 | 3.7% |
| Value | Count | Frequency (%) | |
| 75 | 2 | < 0.1% | |
| 73 | 1 | < 0.1% | |
| 72 | 2 | < 0.1% | |
| 71 | 1 | < 0.1% | |
| 70 | 3 | < 0.1% |
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.0189 |
|---|---|
| Minimum | -2 |
| Maximum | 8 |
| Zeros | 4918 |
| Zeros (%) | 49.2% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.107729884 |
|---|---|
| Coefficient of variation (CV) | -58.61004679 |
| Kurtosis | 2.15989509 |
| Mean | -0.0189 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.635150905 |
| Sum | -189 |
| Variance | 1.227065497 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 4918 | 49.2% | |
| -1 | 1898 | 19.0% | |
| 1 | 1257 | 12.6% | |
| -2 | 906 | 9.1% | |
| 2 | 878 | 8.8% | |
| 3 | 101 | 1.0% | |
| 4 | 25 | 0.2% | |
| 5 | 7 | 0.1% | |
| 6 | 4 | < 0.1% | |
| 8 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| -2 | 906 | 9.1% | |
| -1 | 1898 | 19.0% | |
| 0 | 4918 | 49.2% | |
| 1 | 1257 | 12.6% | |
| 2 | 878 | 8.8% |
| Value | Count | Frequency (%) | |
| 8 | 4 | < 0.1% | |
| 7 | 2 | < 0.1% | |
| 6 | 4 | < 0.1% | |
| 5 | 7 | 0.1% | |
| 4 | 25 | 0.2% |
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.1455 |
|---|---|
| Minimum | -2 |
| Maximum | 7 |
| Zeros | 5269 |
| Zeros (%) | 52.7% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 7 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.186031373 |
|---|---|
| Coefficient of variation (CV) | -8.151418369 |
| Kurtosis | 1.378653128 |
| Mean | -0.1455 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.7640536384 |
| Sum | -1455 |
| Variance | 1.406670417 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 5269 | 52.7% | |
| -1 | 2033 | 20.3% | |
| -2 | 1264 | 12.6% | |
| 2 | 1259 | 12.6% | |
| 3 | 121 | 1.2% | |
| 4 | 30 | 0.3% | |
| 5 | 8 | 0.1% | |
| 1 | 7 | 0.1% | |
| 6 | 5 | 0.1% | |
| 7 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| -2 | 1264 | 12.6% | |
| -1 | 2033 | 20.3% | |
| 0 | 5269 | 52.7% | |
| 1 | 7 | 0.1% | |
| 2 | 1259 | 12.6% |
| Value | Count | Frequency (%) | |
| 7 | 4 | < 0.1% | |
| 6 | 5 | 0.1% | |
| 5 | 8 | 0.1% | |
| 4 | 30 | 0.3% | |
| 3 | 121 | 1.2% |
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.186 |
|---|---|
| Minimum | -2 |
| Maximum | 7 |
| Zeros | 5340 |
| Zeros (%) | 53.4% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 7 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.17266428 |
|---|---|
| Coefficient of variation (CV) | -6.304646668 |
| Kurtosis | 2.192194996 |
| Mean | -0.186 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.8483908116 |
| Sum | -1860 |
| Variance | 1.375141514 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 5340 | 53.4% | |
| -1 | 2006 | 20.1% | |
| -2 | 1346 | 13.5% | |
| 2 | 1182 | 11.8% | |
| 3 | 76 | 0.8% | |
| 4 | 24 | 0.2% | |
| 5 | 11 | 0.1% | |
| 7 | 10 | 0.1% | |
| 6 | 4 | < 0.1% | |
| 1 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| -2 | 1346 | 13.5% | |
| -1 | 2006 | 20.1% | |
| 0 | 5340 | 53.4% | |
| 1 | 1 | < 0.1% | |
| 2 | 1182 | 11.8% |
| Value | Count | Frequency (%) | |
| 7 | 10 | 0.1% | |
| 6 | 4 | < 0.1% | |
| 5 | 11 | 0.1% | |
| 4 | 24 | 0.2% | |
| 3 | 76 | 0.8% |
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.2311 |
|---|---|
| Minimum | -2 |
| Maximum | 7 |
| Zeros | 5530 |
| Zeros (%) | 55.3% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 7 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.153700955 |
|---|---|
| Coefficient of variation (CV) | -4.992215295 |
| Kurtosis | 3.713579895 |
| Mean | -0.2311 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.022541963 |
| Sum | -2311 |
| Variance | 1.331025893 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 5530 | 55.3% | |
| -1 | 1934 | 19.3% | |
| -2 | 1424 | 14.2% | |
| 2 | 991 | 9.9% | |
| 3 | 65 | 0.7% | |
| 4 | 24 | 0.2% | |
| 7 | 21 | 0.2% | |
| 5 | 10 | 0.1% | |
| 1 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| -2 | 1424 | 14.2% | |
| -1 | 1934 | 19.3% | |
| 0 | 5530 | 55.3% | |
| 1 | 1 | < 0.1% | |
| 2 | 991 | 9.9% |
| Value | Count | Frequency (%) | |
| 7 | 21 | 0.2% | |
| 5 | 10 | 0.1% | |
| 4 | 24 | 0.2% | |
| 3 | 65 | 0.7% | |
| 2 | 991 | 9.9% |
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.2717 |
|---|---|
| Minimum | -2 |
| Maximum | 7 |
| Zeros | 5660 |
| Zeros (%) | 56.6% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 7 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.122499492 |
|---|---|
| Coefficient of variation (CV) | -4.131393053 |
| Kurtosis | 3.965619239 |
| Mean | -0.2717 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.9866710449 |
| Sum | -2717 |
| Variance | 1.260005111 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 5660 | 56.6% | |
| -1 | 1864 | 18.6% | |
| -2 | 1506 | 15.1% | |
| 2 | 865 | 8.6% | |
| 3 | 55 | 0.5% | |
| 4 | 26 | 0.3% | |
| 7 | 19 | 0.2% | |
| 5 | 3 | < 0.1% | |
| 6 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| -2 | 1506 | 15.1% | |
| -1 | 1864 | 18.6% | |
| 0 | 5660 | 56.6% | |
| 2 | 865 | 8.6% | |
| 3 | 55 | 0.5% |
| Value | Count | Frequency (%) | |
| 7 | 19 | 0.2% | |
| 6 | 2 | < 0.1% | |
| 5 | 3 | < 0.1% | |
| 4 | 26 | 0.3% | |
| 3 | 55 | 0.5% |
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.2948 |
|---|---|
| Minimum | -2 |
| Maximum | 7 |
| Zeros | 5443 |
| Zeros (%) | 54.4% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | -2 |
| Q1 | -1 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 7 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.143601216 |
|---|---|
| Coefficient of variation (CV) | -3.879244289 |
| Kurtosis | 3.412360652 |
| Mean | -0.2948 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.9385784289 |
| Sum | -2948 |
| Variance | 1.307823742 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 5443 | 54.4% | |
| -1 | 1918 | 19.2% | |
| -2 | 1628 | 16.3% | |
| 2 | 914 | 9.1% | |
| 3 | 56 | 0.6% | |
| 7 | 15 | 0.1% | |
| 4 | 13 | 0.1% | |
| 6 | 8 | 0.1% | |
| 5 | 5 | 0.1% |
| Value | Count | Frequency (%) | |
| -2 | 1628 | 16.3% | |
| -1 | 1918 | 19.2% | |
| 0 | 5443 | 54.4% | |
| 2 | 914 | 9.1% | |
| 3 | 56 | 0.6% |
| Value | Count | Frequency (%) | |
| 7 | 15 | 0.1% | |
| 6 | 8 | 0.1% | |
| 5 | 5 | 0.1% | |
| 4 | 13 | 0.1% | |
| 3 | 56 | 0.6% |
| Distinct | 8358 |
|---|---|
| Distinct (%) | 83.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50612.1955 |
|---|---|
| Minimum | -154973 |
| Maximum | 653062 |
| Zeros | 668 |
| Zeros (%) | 6.7% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | -154973 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3349.5 |
| median | 21527 |
| Q3 | 67139.75 |
| 95-th percentile | 197004.75 |
| Maximum | 653062 |
| Range | 808035 |
| Interquartile range (IQR) | 63790.25 |
Descriptive statistics
| Standard deviation | 73300.75 |
|---|---|
| Coefficient of variation (CV) | 1.448282361 |
| Kurtosis | 9.73669744 |
| Mean | 50612.1955 |
| Median Absolute Deviation (MAD) | 21112 |
| Skewness | 2.675507001 |
| Sum | 506121955 |
| Variance | 5372999950 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 668 | 6.7% | |
| 100 | 101 | 1.0% | |
| 390 | 67 | 0.7% | |
| 326 | 22 | 0.2% | |
| 780 | 21 | 0.2% | |
| 2500 | 20 | 0.2% | |
| 316 | 19 | 0.2% | |
| 396 | 17 | 0.2% | |
| 2400 | 14 | 0.1% | |
| -5 | 11 | 0.1% | |
| Other values (8348) | 9040 | 90.4% |
| Value | Count | Frequency (%) | |
| -154973 | 1 | < 0.1% | |
| -10682 | 1 | < 0.1% | |
| -9095 | 1 | < 0.1% | |
| -8187 | 1 | < 0.1% | |
| -7438 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 653062 | 1 | < 0.1% | |
| 626648 | 1 | < 0.1% | |
| 621749 | 1 | < 0.1% | |
| 613860 | 1 | < 0.1% | |
| 610723 | 1 | < 0.1% |
| Distinct | 8166 |
|---|---|
| Distinct (%) | 81.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49023.6507 |
|---|---|
| Minimum | -67526 |
| Maximum | 671563 |
| Zeros | 929 |
| Zeros (%) | 9.3% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | -67526 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2650 |
| median | 20792 |
| Q3 | 64672 |
| 95-th percentile | 192981.85 |
| Maximum | 671563 |
| Range | 739089 |
| Interquartile range (IQR) | 62022 |
Descriptive statistics
| Standard deviation | 71281.03389 |
|---|---|
| Coefficient of variation (CV) | 1.454013173 |
| Kurtosis | 9.807759891 |
| Mean | 49023.6507 |
| Median Absolute Deviation (MAD) | 20476 |
| Skewness | 2.68631414 |
| Sum | 490236507 |
| Variance | 5080985792 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 929 | 9.3% | |
| 390 | 62 | 0.6% | |
| 316 | 27 | 0.3% | |
| 780 | 25 | 0.2% | |
| 326 | 23 | 0.2% | |
| 2500 | 16 | 0.2% | |
| 2400 | 15 | 0.1% | |
| 396 | 14 | 0.1% | |
| -200 | 12 | 0.1% | |
| 416 | 9 | 0.1% | |
| Other values (8156) | 8868 | 88.7% |
| Value | Count | Frequency (%) | |
| -67526 | 1 | < 0.1% | |
| -30000 | 1 | < 0.1% | |
| -26214 | 1 | < 0.1% | |
| -24704 | 1 | < 0.1% | |
| -17810 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 671563 | 1 | < 0.1% | |
| 624475 | 1 | < 0.1% | |
| 597793 | 1 | < 0.1% | |
| 586825 | 1 | < 0.1% | |
| 555086 | 1 | < 0.1% |
| Distinct | 8066 |
|---|---|
| Distinct (%) | 80.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 46799.9364 |
|---|---|
| Minimum | -34041 |
| Maximum | 855086 |
| Zeros | 1022 |
| Zeros (%) | 10.2% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | -34041 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2459.25 |
| median | 19688.5 |
| Q3 | 60810.75 |
| 95-th percentile | 185668.7 |
| Maximum | 855086 |
| Range | 889127 |
| Interquartile range (IQR) | 58351.5 |
Descriptive statistics
| Standard deviation | 69673.91968 |
|---|---|
| Coefficient of variation (CV) | 1.488760991 |
| Kurtosis | 12.57272627 |
| Mean | 46799.9364 |
| Median Absolute Deviation (MAD) | 19372.5 |
| Skewness | 2.925391352 |
| Sum | 467999364 |
| Variance | 4854455084 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 1022 | 10.2% | |
| 390 | 86 | 0.9% | |
| 316 | 28 | 0.3% | |
| 780 | 21 | 0.2% | |
| 326 | 21 | 0.2% | |
| 396 | 19 | 0.2% | |
| 2400 | 15 | 0.1% | |
| 2500 | 11 | 0.1% | |
| 200 | 10 | 0.1% | |
| 500 | 9 | 0.1% | |
| Other values (8056) | 8758 | 87.6% |
| Value | Count | Frequency (%) | |
| -34041 | 1 | < 0.1% | |
| -24702 | 1 | < 0.1% | |
| -15910 | 1 | < 0.1% | |
| -15641 | 1 | < 0.1% | |
| -15000 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 855086 | 1 | < 0.1% | |
| 689643 | 1 | < 0.1% | |
| 689627 | 1 | < 0.1% | |
| 632041 | 1 | < 0.1% | |
| 597415 | 1 | < 0.1% |
| Distinct | 7928 |
|---|---|
| Distinct (%) | 79.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42601.8695 |
|---|---|
| Minimum | -170000 |
| Maximum | 706864 |
| Zeros | 1162 |
| Zeros (%) | 11.6% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | -170000 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1927.5 |
| median | 18758.5 |
| Q3 | 53434.5 |
| 95-th percentile | 171915.5 |
| Maximum | 706864 |
| Range | 876864 |
| Interquartile range (IQR) | 51507 |
Descriptive statistics
| Standard deviation | 64340.97914 |
|---|---|
| Coefficient of variation (CV) | 1.510285344 |
| Kurtosis | 11.81937837 |
| Mean | 42601.8695 |
| Median Absolute Deviation (MAD) | 18368.5 |
| Skewness | 2.894223482 |
| Sum | 426018695 |
| Variance | 4139761597 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 1162 | 11.6% | |
| 390 | 69 | 0.7% | |
| 780 | 35 | 0.4% | |
| 316 | 26 | 0.3% | |
| 326 | 18 | 0.2% | |
| 2400 | 16 | 0.2% | |
| 150 | 15 | 0.1% | |
| 396 | 14 | 0.1% | |
| 2500 | 13 | 0.1% | |
| 300 | 10 | 0.1% | |
| Other values (7918) | 8622 | 86.2% |
| Value | Count | Frequency (%) | |
| -170000 | 1 | < 0.1% | |
| -81334 | 1 | < 0.1% | |
| -65167 | 1 | < 0.1% | |
| -34503 | 1 | < 0.1% | |
| -15910 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 706864 | 1 | < 0.1% | |
| 616836 | 1 | < 0.1% | |
| 569034 | 1 | < 0.1% | |
| 541019 | 1 | < 0.1% | |
| 530672 | 1 | < 0.1% |
| Distinct | 7771 |
|---|---|
| Distinct (%) | 77.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39850.1717 |
|---|---|
| Minimum | -37594 |
| Maximum | 551702 |
| Zeros | 1267 |
| Zeros (%) | 12.7% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | -37594 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1584.25 |
| median | 17790.5 |
| Q3 | 49799.75 |
| 95-th percentile | 164914.6 |
| Maximum | 551702 |
| Range | 589296 |
| Interquartile range (IQR) | 48215.5 |
Descriptive statistics
| Standard deviation | 60256.79134 |
|---|---|
| Coefficient of variation (CV) | 1.512083607 |
| Kurtosis | 10.25660286 |
| Mean | 39850.1717 |
| Median Absolute Deviation (MAD) | 17400.5 |
| Skewness | 2.7718537 |
| Sum | 398501717 |
| Variance | 3630880903 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 1267 | 12.7% | |
| 390 | 69 | 0.7% | |
| 316 | 29 | 0.3% | |
| 780 | 26 | 0.3% | |
| 326 | 19 | 0.2% | |
| 150 | 18 | 0.2% | |
| 396 | 18 | 0.2% | |
| 2400 | 14 | 0.1% | |
| 2500 | 13 | 0.1% | |
| 300 | 11 | 0.1% | |
| Other values (7761) | 8516 | 85.2% |
| Value | Count | Frequency (%) | |
| -37594 | 1 | < 0.1% | |
| -30481 | 1 | < 0.1% | |
| -23003 | 1 | < 0.1% | |
| -20753 | 1 | < 0.1% | |
| -20006 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 551702 | 1 | < 0.1% | |
| 514114 | 1 | < 0.1% | |
| 503914 | 1 | < 0.1% | |
| 501474 | 1 | < 0.1% | |
| 480722 | 1 | < 0.1% |
| Distinct | 7652 |
|---|---|
| Distinct (%) | 76.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38425.6049 |
|---|---|
| Minimum | -339603 |
| Maximum | 568638 |
| Zeros | 1430 |
| Zeros (%) | 14.3% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | -339603 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1053.5 |
| median | 16524.5 |
| Q3 | 48802.5 |
| 95-th percentile | 162110.45 |
| Maximum | 568638 |
| Range | 908241 |
| Interquartile range (IQR) | 47749 |
Descriptive statistics
| Standard deviation | 59510.34968 |
|---|---|
| Coefficient of variation (CV) | 1.548716015 |
| Kurtosis | 10.48931854 |
| Mean | 38425.6049 |
| Median Absolute Deviation (MAD) | 16333 |
| Skewness | 2.744849352 |
| Sum | 384256049 |
| Variance | 3541481719 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 1430 | 14.3% | |
| 390 | 60 | 0.6% | |
| 150 | 30 | 0.3% | |
| 316 | 26 | 0.3% | |
| 780 | 18 | 0.2% | |
| 2500 | 17 | 0.2% | |
| 326 | 17 | 0.2% | |
| 416 | 13 | 0.1% | |
| 396 | 13 | 0.1% | |
| -2 | 12 | 0.1% | |
| Other values (7642) | 8364 | 83.6% |
| Value | Count | Frequency (%) | |
| -339603 | 1 | < 0.1% | |
| -150953 | 1 | < 0.1% | |
| -73895 | 1 | < 0.1% | |
| -51183 | 1 | < 0.1% | |
| -45734 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 568638 | 1 | < 0.1% | |
| 527711 | 1 | < 0.1% | |
| 499100 | 1 | < 0.1% | |
| 478034 | 1 | < 0.1% | |
| 472480 | 1 | < 0.1% |
| Distinct | 3815 |
|---|---|
| Distinct (%) | 38.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6015.1521 |
|---|---|
| Minimum | 0 |
| Maximum | 493358 |
| Zeros | 1837 |
| Zeros (%) | 18.4% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 834.5 |
| median | 2100 |
| Q3 | 5022.25 |
| 95-th percentile | 19689.4 |
| Maximum | 493358 |
| Range | 493358 |
| Interquartile range (IQR) | 4187.75 |
Descriptive statistics
| Standard deviation | 17944.06816 |
|---|---|
| Coefficient of variation (CV) | 2.983144542 |
| Kurtosis | 160.025536 |
| Mean | 6015.1521 |
| Median Absolute Deviation (MAD) | 2050 |
| Skewness | 10.61281748 |
| Sum | 60151521 |
| Variance | 321989582.1 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 1837 | 18.4% | |
| 2000 | 453 | 4.5% | |
| 3000 | 276 | 2.8% | |
| 5000 | 237 | 2.4% | |
| 1500 | 173 | 1.7% | |
| 4000 | 146 | 1.5% | |
| 10000 | 138 | 1.4% | |
| 1000 | 111 | 1.1% | |
| 2500 | 88 | 0.9% | |
| 6000 | 84 | 0.8% | |
| Other values (3805) | 6457 | 64.6% |
| Value | Count | Frequency (%) | |
| 0 | 1837 | 18.4% | |
| 1 | 3 | < 0.1% | |
| 2 | 3 | < 0.1% | |
| 3 | 8 | 0.1% | |
| 4 | 7 | 0.1% |
| Value | Count | Frequency (%) | |
| 493358 | 1 | < 0.1% | |
| 368199 | 1 | < 0.1% | |
| 304815 | 1 | < 0.1% | |
| 302000 | 1 | < 0.1% | |
| 300039 | 1 | < 0.1% |
| Distinct | 3755 |
|---|---|
| Distinct (%) | 37.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6141.2774 |
|---|---|
| Minimum | 0 |
| Maximum | 1227082 |
| Zeros | 1823 |
| Zeros (%) | 18.2% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 799.75 |
| median | 2009.5 |
| Q3 | 5000 |
| 95-th percentile | 19000.1 |
| Maximum | 1227082 |
| Range | 1227082 |
| Interquartile range (IQR) | 4200.25 |
Descriptive statistics
| Standard deviation | 24637.56804 |
|---|---|
| Coefficient of variation (CV) | 4.011798594 |
| Kurtosis | 964.5053804 |
| Mean | 6141.2774 |
| Median Absolute Deviation (MAD) | 1990.5 |
| Skewness | 24.47786596 |
| Sum | 61412774 |
| Variance | 607009759 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 1823 | 18.2% | |
| 2000 | 418 | 4.2% | |
| 3000 | 283 | 2.8% | |
| 5000 | 264 | 2.6% | |
| 1000 | 172 | 1.7% | |
| 1500 | 161 | 1.6% | |
| 4000 | 139 | 1.4% | |
| 10000 | 106 | 1.1% | |
| 6000 | 89 | 0.9% | |
| 2500 | 83 | 0.8% | |
| Other values (3745) | 6462 | 64.6% |
| Value | Count | Frequency (%) | |
| 0 | 1823 | 18.2% | |
| 1 | 2 | < 0.1% | |
| 2 | 6 | 0.1% | |
| 3 | 4 | < 0.1% | |
| 4 | 5 | 0.1% |
| Value | Count | Frequency (%) | |
| 1227082 | 1 | < 0.1% | |
| 1024516 | 1 | < 0.1% | |
| 580464 | 1 | < 0.1% | |
| 415552 | 1 | < 0.1% | |
| 401003 | 1 | < 0.1% |
| Distinct | 3598 |
|---|---|
| Distinct (%) | 36.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4962.3974 |
|---|---|
| Minimum | 0 |
| Maximum | 400972 |
| Zeros | 2018 |
| Zeros (%) | 20.2% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 390 |
| median | 1773.5 |
| Q3 | 4496.25 |
| 95-th percentile | 16936.35 |
| Maximum | 400972 |
| Range | 400972 |
| Interquartile range (IQR) | 4106.25 |
Descriptive statistics
| Standard deviation | 14615.95517 |
|---|---|
| Coefficient of variation (CV) | 2.945341534 |
| Kurtosis | 190.477759 |
| Mean | 4962.3974 |
| Median Absolute Deviation (MAD) | 1773.5 |
| Skewness | 11.17283726 |
| Sum | 49623974 |
| Variance | 213626145.5 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 2018 | 20.2% | |
| 2000 | 422 | 4.2% | |
| 1000 | 365 | 3.6% | |
| 3000 | 297 | 3.0% | |
| 5000 | 233 | 2.3% | |
| 1500 | 162 | 1.6% | |
| 4000 | 133 | 1.3% | |
| 10000 | 98 | 1.0% | |
| 1200 | 86 | 0.9% | |
| 6000 | 65 | 0.7% | |
| Other values (3588) | 6121 | 61.2% |
| Value | Count | Frequency (%) | |
| 0 | 2018 | 20.2% | |
| 1 | 2 | < 0.1% | |
| 2 | 6 | 0.1% | |
| 3 | 3 | < 0.1% | |
| 4 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 400972 | 1 | < 0.1% | |
| 397092 | 1 | < 0.1% | |
| 310852 | 1 | < 0.1% | |
| 234456 | 1 | < 0.1% | |
| 232242 | 1 | < 0.1% |
| Distinct | 3355 |
|---|---|
| Distinct (%) | 33.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4963.5525 |
|---|---|
| Minimum | 0 |
| Maximum | 528897 |
| Zeros | 2210 |
| Zeros (%) | 22.1% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 218.75 |
| median | 1500 |
| Q3 | 4000 |
| 95-th percentile | 17002.8 |
| Maximum | 528897 |
| Range | 528897 |
| Interquartile range (IQR) | 3781.25 |
Descriptive statistics
| Standard deviation | 17275.45798 |
|---|---|
| Coefficient of variation (CV) | 3.480462427 |
| Kurtosis | 282.2000045 |
| Mean | 4963.5525 |
| Median Absolute Deviation (MAD) | 1500 |
| Skewness | 13.64627263 |
| Sum | 49635525 |
| Variance | 298441448.4 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 2210 | 22.1% | |
| 1000 | 439 | 4.4% | |
| 2000 | 376 | 3.8% | |
| 3000 | 305 | 3.0% | |
| 5000 | 258 | 2.6% | |
| 1500 | 150 | 1.5% | |
| 4000 | 130 | 1.3% | |
| 10000 | 88 | 0.9% | |
| 6000 | 83 | 0.8% | |
| 500 | 75 | 0.8% | |
| Other values (3345) | 5886 | 58.9% |
| Value | Count | Frequency (%) | |
| 0 | 2210 | 22.1% | |
| 1 | 8 | 0.1% | |
| 2 | 7 | 0.1% | |
| 3 | 4 | < 0.1% | |
| 4 | 6 | 0.1% |
| Value | Count | Frequency (%) | |
| 528897 | 1 | < 0.1% | |
| 497000 | 1 | < 0.1% | |
| 432130 | 1 | < 0.1% | |
| 400046 | 1 | < 0.1% | |
| 331788 | 1 | < 0.1% |
| Distinct | 3279 |
|---|---|
| Distinct (%) | 32.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4843.7145 |
|---|---|
| Minimum | 0 |
| Maximum | 417990 |
| Zeros | 2324 |
| Zeros (%) | 23.2% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 165 |
| median | 1500 |
| Q3 | 4018.75 |
| 95-th percentile | 15701.3 |
| Maximum | 417990 |
| Range | 417990 |
| Interquartile range (IQR) | 3853.75 |
Descriptive statistics
| Standard deviation | 15934.92723 |
|---|---|
| Coefficient of variation (CV) | 3.28981554 |
| Kurtosis | 164.1611901 |
| Mean | 4843.7145 |
| Median Absolute Deviation (MAD) | 1500 |
| Skewness | 10.83744383 |
| Sum | 48437145 |
| Variance | 253921905.9 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 2324 | 23.2% | |
| 1000 | 423 | 4.2% | |
| 2000 | 419 | 4.2% | |
| 5000 | 293 | 2.9% | |
| 3000 | 291 | 2.9% | |
| 1500 | 147 | 1.5% | |
| 4000 | 146 | 1.5% | |
| 10000 | 108 | 1.1% | |
| 2500 | 81 | 0.8% | |
| 500 | 80 | 0.8% | |
| Other values (3269) | 5688 | 56.9% |
| Value | Count | Frequency (%) | |
| 0 | 2324 | 23.2% | |
| 1 | 4 | < 0.1% | |
| 2 | 6 | 0.1% | |
| 3 | 4 | < 0.1% | |
| 4 | 4 | < 0.1% |
| Value | Count | Frequency (%) | |
| 417990 | 1 | < 0.1% | |
| 332000 | 1 | < 0.1% | |
| 326889 | 1 | < 0.1% | |
| 302823 | 1 | < 0.1% | |
| 284069 | 1 | < 0.1% |
| Distinct | 3293 |
|---|---|
| Distinct (%) | 32.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5082.9775 |
|---|---|
| Minimum | 0 |
| Maximum | 527143 |
| Zeros | 2423 |
| Zeros (%) | 24.2% |
| Memory size | 78.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 100 |
| median | 1500 |
| Q3 | 4003.5 |
| 95-th percentile | 17242.5 |
| Maximum | 527143 |
| Range | 527143 |
| Interquartile range (IQR) | 3903.5 |
Descriptive statistics
| Standard deviation | 16619.73763 |
|---|---|
| Coefficient of variation (CV) | 3.26968546 |
| Kurtosis | 193.7215196 |
| Mean | 5082.9775 |
| Median Absolute Deviation (MAD) | 1500 |
| Skewness | 11.00257951 |
| Sum | 50829775 |
| Variance | 276215678.8 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 2423 | 24.2% | |
| 2000 | 423 | 4.2% | |
| 1000 | 402 | 4.0% | |
| 3000 | 295 | 2.9% | |
| 5000 | 260 | 2.6% | |
| 4000 | 152 | 1.5% | |
| 1500 | 138 | 1.4% | |
| 10000 | 115 | 1.1% | |
| 500 | 90 | 0.9% | |
| 2500 | 67 | 0.7% | |
| Other values (3283) | 5635 | 56.4% |
| Value | Count | Frequency (%) | |
| 0 | 2423 | 24.2% | |
| 1 | 3 | < 0.1% | |
| 2 | 3 | < 0.1% | |
| 3 | 4 | < 0.1% | |
| 4 | 3 | < 0.1% |
| Value | Count | Frequency (%) | |
| 527143 | 1 | < 0.1% | |
| 372495 | 1 | < 0.1% | |
| 345293 | 1 | < 0.1% | |
| 254000 | 1 | < 0.1% | |
| 250400 | 1 | < 0.1% |
default
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 9.8 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) | |
| False | 7762 | 77.6% | |
| True | 2238 | 22.4% |
PAY_1
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.1 KiB |
| 1 | |
|---|---|
| 0 | 100 |
| Value | Count | Frequency (%) | |
| 1 | 9900 | 99.0% | |
| 0 | 100 | 1.0% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| ID | LIMIT_BAL | SEX | EDUCATION | MARRIAGE | AGE | PAY_0 | PAY_2 | PAY_3 | PAY_4 | PAY_5 | PAY_6 | BILL_AMT1 | BILL_AMT2 | BILL_AMT3 | BILL_AMT4 | BILL_AMT5 | BILL_AMT6 | PAY_AMT1 | PAY_AMT2 | PAY_AMT3 | PAY_AMT4 | PAY_AMT5 | PAY_AMT6 | default | PAY_1 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 50000.0 | 2 | 3 | 2 | 23 | 2 | 0 | 0 | 0 | 0 | 0 | 50653.0 | 49348.0 | 47995.0 | 40226.0 | 27828.0 | 28411.0 | 2190.0 | 2027.0 | 2204.0 | 996.0 | 1031.0 | 1047.0 | False | 1.0 |
| 1 | 2 | 10000.0 | 1 | 3 | 2 | 25 | 0 | 0 | 0 | 0 | 0 | -1 | 8525.0 | 5141.0 | 5239.0 | 7911.0 | 17890.0 | 10000.0 | 1500.0 | 5000.0 | 4000.0 | 2000.0 | 22400.0 | 0.0 | False | 1.0 |
| 2 | 3 | 150000.0 | 1 | 3 | 1 | 52 | 0 | 0 | 0 | 0 | 0 | 0 | 88812.0 | 90649.0 | 92499.0 | 94364.0 | 97589.0 | 99921.0 | 2564.0 | 2616.0 | 2647.0 | 4000.0 | 3158.0 | 2215.0 | True | 1.0 |
| 3 | 4 | 280000.0 | 2 | 2 | 2 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 25989.0 | 27052.0 | 28111.0 | 29138.0 | 29852.0 | 30717.0 | 1800.0 | 1800.0 | 1800.0 | 1500.0 | 1500.0 | 1500.0 | False | 1.0 |
| 4 | 5 | 360000.0 | 2 | 1 | 1 | 41 | -2 | -2 | -2 | -2 | -2 | -2 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | True | 1.0 |
| 5 | 6 | 210000.0 | 2 | 2 | 2 | 27 | -1 | -1 | -1 | 0 | 0 | 2 | 5353.0 | 280.0 | 4609.0 | 4703.0 | 6324.0 | 1621.0 | 280.0 | 4609.0 | 94.0 | 1621.0 | 0.0 | 32.0 | False | 1.0 |
| 6 | 7 | 120000.0 | 1 | 1 | 1 | 51 | 1 | -2 | -2 | -1 | 0 | -1 | 0.0 | -416.0 | -1248.0 | 832.0 | 416.0 | 1398.0 | 0.0 | 0.0 | 2080.0 | 0.0 | 1398.0 | 0.0 | True | 1.0 |
| 7 | 8 | 120000.0 | 2 | 3 | 1 | 39 | 1 | 2 | 2 | 2 | 2 | 2 | 69830.0 | 68108.0 | 70415.0 | 74443.0 | 75527.0 | 77171.0 | 0.0 | 3400.0 | 5800.0 | 2900.0 | 3000.0 | 3100.0 | False | 1.0 |
| 8 | 9 | 70000.0 | 2 | 3 | 1 | 27 | 2 | 2 | 2 | 2 | 2 | 2 | 27241.0 | 30416.0 | 29628.0 | 32350.0 | 33218.0 | 32532.0 | 3628.0 | 0.0 | 3218.0 | 1532.0 | 0.0 | 2257.0 | True | 1.0 |
| 9 | 10 | 170000.0 | 1 | 1 | 2 | 26 | -1 | -1 | -1 | -1 | -1 | -1 | 23594.0 | 1512.0 | 1362.0 | 1591.0 | 3524.0 | 8545.0 | 1512.0 | 1362.0 | 1591.0 | 3524.0 | 8545.0 | 1485.0 | False | 1.0 |
Last rows
| ID | LIMIT_BAL | SEX | EDUCATION | MARRIAGE | AGE | PAY_0 | PAY_2 | PAY_3 | PAY_4 | PAY_5 | PAY_6 | BILL_AMT1 | BILL_AMT2 | BILL_AMT3 | BILL_AMT4 | BILL_AMT5 | BILL_AMT6 | PAY_AMT1 | PAY_AMT2 | PAY_AMT3 | PAY_AMT4 | PAY_AMT5 | PAY_AMT6 | default | PAY_1 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 9990 | 9991 | 80000.0 | 2 | 2 | 2 | 28 | 0 | 0 | 0 | 0 | 0 | 0 | 79688.0 | 78165.0 | 68579.0 | 50312.0 | 44277.0 | 44488.0 | 2800.0 | 2398.0 | 2000.0 | 2000.0 | 1700.0 | 2005.0 | False | 1.0 |
| 9991 | 9992 | 100000.0 | 2 | 3 | 1 | 24 | -1 | -1 | -1 | -1 | -1 | -2 | 440.0 | 470.0 | 470.0 | 470.0 | 0.0 | 0.0 | 500.0 | 470.0 | 470.0 | 0.0 | 0.0 | 0.0 | False | 1.0 |
| 9992 | 9993 | 90000.0 | 2 | 2 | 2 | 24 | -1 | 0 | 0 | 0 | 0 | 0 | 17524.0 | 22184.0 | 23270.0 | 27372.0 | 31905.0 | 34954.0 | 5000.0 | 5000.0 | 5000.0 | 5000.0 | 3600.0 | 2000.0 | False | 1.0 |
| 9993 | 9994 | 20000.0 | 1 | 2 | 2 | 27 | 0 | 0 | 0 | 0 | 0 | 0 | 18860.0 | 17904.0 | 15300.0 | 12776.0 | 3585.0 | 0.0 | 1360.0 | 1206.0 | 256.0 | 72.0 | 0.0 | 0.0 | False | 1.0 |
| 9994 | 9995 | 10000.0 | 1 | 1 | 2 | 26 | -1 | 2 | -1 | -1 | -1 | -2 | 10252.0 | 5677.0 | 2735.0 | 4564.0 | 0.0 | 0.0 | 0.0 | 2735.0 | 4564.0 | 0.0 | 0.0 | 0.0 | False | 1.0 |
| 9995 | 9996 | 130000.0 | 1 | 1 | 2 | 34 | 0 | 0 | 0 | 0 | 0 | 0 | 23292.0 | 14077.0 | 15546.0 | 108047.0 | 93708.0 | 97353.0 | 3000.0 | 2000.0 | 93000.0 | 4000.0 | 5027.0 | 4005.0 | False | 1.0 |
| 9996 | 9997 | 100000.0 | 1 | 1 | 1 | 35 | 1 | 2 | -1 | -1 | 0 | 0 | 3515.0 | 2975.0 | 2342.0 | 12016.0 | 10203.0 | 5323.0 | 10.0 | 3141.0 | 12021.0 | 135.0 | 507.0 | 6.0 | False | 1.0 |
| 9997 | 9998 | 280000.0 | 1 | 1 | 1 | 30 | 0 | 0 | 0 | 0 | 0 | 0 | 166037.0 | 166291.0 | 162992.0 | 134154.0 | 161057.0 | 167490.0 | 12126.0 | 39102.0 | 5000.0 | 30000.0 | 10000.0 | 10000.0 | False | 1.0 |
| 9998 | 9999 | 170000.0 | 2 | 2 | 2 | 27 | 2 | 0 | 0 | 0 | 0 | 0 | 173577.0 | 171480.0 | 171794.0 | 166637.0 | 169021.0 | 164531.0 | 6500.0 | 7100.0 | 6100.0 | 6000.0 | 5600.0 | 7700.0 | True | 1.0 |
| 9999 | 10000 | 150000.0 | 2 | 1 | 2 | 27 | 2 | 2 | 2 | 2 | 0 | 0 | 15475.0 | 18212.0 | 18617.0 | 18020.0 | 18726.0 | 19414.0 | 3000.0 | 1000.0 | 0.0 | 1000.0 | 1000.0 | 1000.0 | True | 1.0 |